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< ! Start Fragment -->RESULT 4 
AB007453 

AB007453 3500 bp mRNA linear VRT 08-JUN-2002 

Chrysophrys major mRNA for estrogen receptor, complete cds . 
AB007453 

ABO-07453.1 GI: 2447037 
estrogen receptor. 
Pagrus major (red seabream) 
Pagrus major 

Eukaryota ; Metazoa ; Chordata ; Craniata ; Vertebrata ; Euteleostomi ; 
Actinopterygii ; Neopterygii ; Teleostei ; Euteleostei ; Neoteleostei ; 
Acanthomorpha ; Acanthopterygii ; Percomorpha; Perciformes; 
Percoidei; Sparidae; Pagrus. 
1 

Touhata , K . , Kinoshi ta , M . , Toyohara , H . and Sakaguchi , M . 
Sequence and expression of a cDNA encoding the red seabream 
estrogen receptor 
Fisheries Sci. 64, 131-135 (1998) 
2 (bases 1 to 3500) 
Touhata, K. and Toyohara, H. 
Direct Submission 

Submitted ( 19-SEP-1997). Ken Touhata, Japan Society for the 
Promotion Science, National Research Institute of Fisheries 
Science; 2-12-4 Fukuura, Kanazawa-ku, Yokohama, Kanagawa 236-8648, 
Japan (E-mail : ktouhataOf ra .affrc .go. jp, Tel : 81-45-788-7685, 
Fax:81-45-788-5001) 

Location/Qualifiers 
1. .3500 

/organism=" Pagrus major" 
/mol_type= " mRNA" 
/db_xref="taxon: 143350" 
/tissue_type=" liver" 
/note=" Pagrus major" 
207. .1952 
/codon_start=l 

/product^" estrogen receptor" 
/pro te in_i d= " BAA2 2517.1" 
/db_xref ="GI : 244703 8" 

/trans la tion="MYPEDSRGSGGVATVDFLEGTYDYAAPTPAPTPLYSHSTPGYYS 
APLDAHGPPSDGSLQSLGSGPNSPLVFVPSSPRLSPFMHPPTHHYLETTSTPVYRSSV 
PSSQQSVSREDQCGTSDDSYSVGESGAGALAAGFEIAKEMRFCAVCSDYASGYHYGVW 
SCEGCKAFFKRSIQGHNDYMCPATNQCTIDRNRRKSCQACRLRKCYEVGMMKGGMRKD 
RGRVLRRDKQRTGTSDRDKAS KGLEHRTAPPQDRRKH I S S S AGGGGGKS SMI SMP PDQ 
VLLLLQGAEPPMLCSRQKLNRPYTEWMMTLLTSMADKELVHMIAWAKKLPGFLQLSL 
HDQVQLLESSWLEVLMIGLIWRSIHCPGKLIFAQDLILDRSEGDCVEGMAEIFDMLLA 
TASRFRMLKLKPEEFVCLKAIILLNSGAFSFCTGTMEPLHDGAAVQNMLDTITDALIH 
HINQSGCSAQQQSRRQAQLLLLLSHIRHMSNKGMEHLYSMKCKNKVPLYDLLLEMLDA 
HR I HRADRPAETWSQADRE PPFTSRNS SGGGGGGGGGS S S AGSTSGPRVSHE S PTS PG 
VLQYGGSRSECTHIL" 
3500 

/note="25 A nucleotides" 
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ACCESSION 
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REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
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AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CDS 



polyA_site 



ORIGIN 



Query Match 8 0.0%; 

Best Local Similarity 89.0%; 
Matches 1564; Conservative 



Score 1396.8; 
Pred . No . 0 ; 
0 ; Mismatches 



DB 11; Length 3500; 
167; Indels 27; Gaps 



4; 



Db 



1 ATGTACCCTGAAGAGAGCAGGGGGTCCGGAGGGGTAGCCACTGTGGACTTTCTGGAAGGG 60 

MINIM MMI Ml 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIIMIII 

2 07 ATGTACCCCGAAGACAGCCGGGGGTCTGGAGGGGTAGCCACTGTGGACTTCCTGGAAGGG 266 
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Qy 61 ACCTACGATTATGCCGCCCCCACCCCTGCCCCGACTCCTCTTTACAGCCAGT CT 114 

II 1 1 1 II II 1 1 II I II II II 1 1 1 1 1 i 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 I II 

Db 267 ACGTACGACTATGCCGCCCCCACCCCTGCCCCGACTCCTCTTTATAGCCACTCCACCCCT 326 

Qy 115 GGCTACTACTCTGTACCTCTGGACGCCCAAGGGCCACCCTCAGATGGCAGCCTTCAGTCC 174 

1 1 II 1 1 1 1 1 1 II I INI I MINI I II II Mill I Mil I III 1 1 Mill I MM 

Db 32 7 GGCTACTACTCTGCTCCTCTGGACGCCCACGGACCACCCTCAGATGGGAGCCTTCAGTCC 38 6 

Qy 175 CTGGGCAGCGGGCCTACCAGTCCTCTTGTGTTTGTGCCGTCCAGCCCCAGACTCAGCCCC 234 

Ml MMMM Ml I II Ml MM I III I MM 1 1 Mill I Ml I II MUM 

Db 3 87 GTGGGCAGCGGGCCCAACAGTCCTCTAGTGTTTGTGCCCTCCAGCCCCCGTCTTAGCCCC 446 

Qy 2 35 TTTATGCACCCGCCCAGCCACCACTATCTGGAAACCACCTCAACACCCGTCTACAGGTCT 294 

II 1 1 1 II II 1 1 II M 1 1 1 1 II M I M 1 1 1 II I II II 1 1 1 1 II II 1 1 II II 1 1 II II I 

Db 447 TTCATGCACCCGCCCACCCACCACTATCTGGAAACCACCTCAACACCCGTCTACAGGTCC 506 

Qy 295 AGTGTCCCATCCAGCCAGCAGCCAGTTCCCAGAGAGGACCAGTGTGCCACCAGTGATGAG 3 54 

I II Mill I MM I MUM Mill I 1 1 MUM I Mill 1 1 III I Mill II 

Db 507 ^GTGTCCCATCCAGTCAGCAGTCAGTTTCAAGAGAGGACCAGTGTGGCACCAGTGACGAC 566 
Qy 355 TCCTATAGTGTGGGGGAGTCAGGGGCTGG AGCCAGGGGGTTTGAGATGGCCAAG 4 08 

M il II Mill llllllll II II III MMMM III Mill 

Db 567 TCATACAGCGTGGGAGAGTCAGGTGCCGGAGCGTTAGCGGCTGGGTTTGAGATTGCCAAA 626 

Qy 4 09 GAGATGCGTTTCTGTGCTGTGTGCAGTGACTATGCCTCTGGGTACCACTACGGGGTGTGG 468 

i i iiiiiiiiiiiiii iiiMiii ii iiiiiiiiiiiiiiiii IMIM MMII 

Db 627 GAAATGCGTTTCTGTGCCGTGTGCAGCGATTATGCCTCTGGGTACCATTACGGGGTGTGG 686 

Qy 469 TCCTGTGAAGGCTGTAAGGCCTTCTTTAAGAGGAGCATCCAGGGTCACAATGACTATATG 528 

llllllll INN I I I 1 I I I I I I I I I i I I I 1 1 I 1 I I I I I I I M 1 I I I I I 1 1 I I I I M 
Db 687 TCCTGTGAGGGCTGCAAGGCCTTCTTTAAGAGGAGCATACAGGGTCACAATGACTATATG 74 6 

Qy 529 TGCCCAGCAACCAATCAGTGTACTATTGACAGGAATCGGAGAAAGAGCTGCCAGGCTTGC 58 8 

- M II II II 1 1 1 M 1 1 1 M I M 1 1 1 M II M 1 1 M 1 1 M I M 1 1 II M 1 1 1 1 M 1 1 M 1 1 

Db 74 7 TGCCCAGCAACCAATCAGTGTACTATTGACAGGAATCGGAGGAAGAGCTGCCAGGCTTGC 80 6 

q v 589 CGTCTTAGGAAGTGTTATGAAGTGGGCATGATGAAAGGAGGTGTTCGCAAGGACCGTGGC 64 8 

MMMMMMIMM IIIIIIIIMIIIMIMIMIII I IIIIIIIIIIIIII 

Db 8 07 CGTCTTAGGAAGTGTTACGAAGTGGGCATGATGAAAGGAGGTATGCGCAAGGACCGTGGG 86 6 

Q V 64 9 CGTGTTTTGCGCCGTGATAAACGACGTGCTGGAACCAATGACCGAGAGAAGGCCTCTAAG 70 8 

M Ml MM Mill MM III MM MM MM MM 1 1 1 II 1 1 M I 

Db 8 67 CGCGTTCTGCGGCGTGACAAACAACGGACTGGCACCAGTGACAGAGACAAGGCCTCTAAG 92 6 

q v 7 09 GACCTGGAGTACAAAACAGTGCCCCCTCAGGACAGGAGGAAACACAGCAGCAGCAGCAGT 768 

I MUM Ml Mill I III 1 1 Ml I M 1 1 M 1 1 Ml Ml M IMIMIIM 

Db 927 GGTCTGGAGCACAGAACAGCGCCCCCTCAGGACAGGAGGAAACA- - - CAT C AGC AGCAGT 983 

q v 769 GCCGGTGGTGGAGGAGGAAAATCATCAGTGACCGGGATGTCTCCTGACCAGGTGCTCCTC 82 8 

II MMMM MMMM llllll Ml I I III 1 1 II 1 1 M II 1 1 II 1 1 II II 

Db 984 GCTGGTGGTGGTGGAGGAAAGTCATCAATGATCAGCATGCCTCCTGACCAGGTGCTCCTC 104 3 

Q y 829 CTGCTCCAGGGTGCCGAGCCCCCAATGCTGTGCTCCCGTCAGAAGCTGAGCCGACCGTAC 88 8 

IIIIIIIIIIIIII II 1 1 II 1 1 1 Ml 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 II 1 1 1 MMII III 

Db 1044 CTGCTCCAGGGTGCAGAGCCCCCAATGCTGTGCTCCCGTCAGAAGCTGAACCGACCCTAC 1103 

Qv 88 9 ACCGAGGTCACCATAATGACACTACTCACCAGCATGGCCGATAAGGAGCTGGTCCACATG 948 

II I Ml MM MM MMI M llllll Ml Ml II M I II M 1 1 M M I M M I M 

Db 1104 ACCGAGGTCACCATGATGACCCTGCTCACCAGCATGGCTGATAAGGAGCTGGTCCACATG 1163 

Qy 949 ATCACCTGGGCCAAGAAGCTTCCAGGTTTCCTGCAGCTGTCTCTCCATGACCAGGTGCAG 1008 
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Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



III I Mill I III III III III I II MM I Illlll Mill I III MIMI II 

1164 ATCGCATGGGCAAAGAAGCTTCCAGGTTTCCTAC AGCTGTCCCTCCACGACC AGGTGCAG 1223 
10 09 CTGCTGGAGAGCTCGTGGCTGGAGGTGCTGATGATTGGGCTCATATGGAGGTCCATCCAC 1068 

1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 M 1 1 1 II 1 1 Ml I M 1 1 1 IIIIIMI M 1 1 1 1 1 II 1 1 M 1 1 

1224 CTGCTGGAGAGCTCGTGGCTGGAGGTGCTGATGATCGGGCTCATCTGGAGGTCCATCCAC 12 83 
1069 TGCCCCGGCAAACTCATCTTCGCACAGGACCTCATACTGGACAGGAATGAAGGTGACTGT 1128 

IIIIIIMIIIIIIIIIIIIIIIMIIIIMMIIIIIIIIIIMI Mill Illlll 

1284 TGCCCCGGCAAACTCATCTTCGCACAGGACCTCATACTGGACAGGAGCGAAGGCGACTGT 1343 
1129 GTGGAAGGCTTTGTTGAGATCTTCGACATGCTGCTGGCCACTGCCTCCCGCTTCCGCATG 1188 

M Illlll I I II I II Ml MM II Illlll I Mill I Ml II 1 1 III MM II 

1344 GTCGAAGGCATGGCTGAGATCTTCGACATGCTGCTTGCCACCGCCTCCCGTTTCCGCATG 14 03 
1189 CTCAAACTCAAACCTGAGGAGTTTGTCTGCCTCAAAGCTATC ATCCTGCTCAACTCTGGT 1248 

I MMM Illlll 1 1 1 1 1 1 1 1 1 1 MM Ml Ml Illlll Mill II 1 1 1 1 1 1 1 1 II II 

14 04 CTCAAACTCAAACCTGAGGAGTTCGTCTGCCTCAAAGCTATCATCTTGCTCAACTCTGGT 1463 
124 9 GCCTTCTCTTTCTGCACCGGCACAATGGAGCCCCTCCACAACAGCATGGCAGTGCAGAAC 13 08 

Mi M M 1 1 MMM I MMM I MM I M I M 1 1 MM 1 1 II I MM II 

1464 GCCTTCTCTTTCTGCACCGGCACAATGGAGCCCCTCCACGACGGTGCAGCAGTGCAGAAC 1523 
13 09 ATGCTGGACACCATCACAGACGCTCTCATACATCATATCAGCCAATCAGGATGCTCGGCT 1368 

Mill I M I II 1 1 1 1 1 M MM 1 1 II 1 1 II 1 1 1 MM MMM 1 1 1 1 Ml 1 1 MM 

1524 ATGCTCGACACCATCACCGACGCTCTCATACATCACATCAACCAATCCGGATGCTCGGCT 1583 



1369 



CAGCAGCAGTCGAGGCGGCAGGCCCAGCTGCTGCTCCTGCTCTCCCACATCAGGCACATG 1428 

Ml Ml 1 1 M I II I Ml I Ml II I II I III II 1 1 1 II Ml Ml I II I M I II MMM 

15 84 CAGCAGCAGTCGAGGCGGCAGGCCCAGCTGCTCCTCCTGCTCTCCCACATCAGACACATG 164 3 



1429 AGCAACAAAGGCATGGAGCATCTCTACAGCATGAAGTGCAAGAACAAAGTGCCTCTTTAC 14 88 

I M M 1 1 1 1 II M 1 1 M 1 1 M 1 1 1 M 1 1 M 1 1 M 1 1 II 1 1 1 1 1 1 1 M IIIIMM Ml 

1644 AGCAACAAAGGCATGGAGCATCTCTACAGCATGAAGTGCAAGAACAAGGTGCCTCTGTAC 1703 
148 9 GACCTTCTGCTGGAGATGTTGGACGCTCACCGTATACACCGCCCAGACAGACCAGCTCAG 154 8 

Mill 1 1 II 1 1 1 1 M 1 1 MMMMIMM II MMM I MIMMIMM M 

17 04 GACCTGCTGCTGGAGATGCTGGACGCTCACCGCATCCACCGCGCCGACAGACCAGCTGAG 17 63 



1549 



TTCTGGTCCCAGGCTGACGGAGAGCCTCCCTTCATTAACAACAACAACAGCAGCAACAGT 16 0 8 

M 1 1 Ml II I Ml II I 1 1 Ml II I M I M 1 1 II MM MM I II 

17 64 ACCTGGTCCCAGGCTGACAGAGAGCCTCCCTTCACCTCCAGAAACAGCAGCGGTGGTGGT 18 23 
1609 GGCAGCAATGGCGGCGTCTCCTCTTCAGTCGGTTCCAGTTCAGGACCCCGAGTCAACCAC 1668 

M I Ml II MIMMIMM 1 1 II II 1 1 M 1 1 II I 1 1 1 1 M I M I 

1824 GGTGGTGGTGGAGGAGGCTCCTCTTCAGCTGGCTCCACTTCAGGACCACGAGTCAGCCAT 1883 
166 9 GAGAGCCCGAGCAGAGGACCCACAGGTCCAGGAGTCCTGCAGTACGGAGGGTCCCGCTCT 1728 

MMM MMM MM II II 1 1 1 1 M I II 1 1 M 1 1 1 1 1 1 1 1 1 1 1 

1884 GAGAGC CCCACAAGTCCTGGTGTCCTGCAGTACGGAGGGTCCCGCTCC 1931 

1729 GACTGCACCCACATCCTA 1746 

1 1 Mill MM 

193 2 GAGTGCACTCACATCCTA 194 9 
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< i --StartFragment-->RESULT 3 
ESR1 PAGMA 



ID ESR1_PAGMA STANDARD; PRT; 581 AA. 

AC 042132; 

DT 15-JUL-1998, integrated into UniProtKB/Swiss-Prot . 

DT 01-JAN-1998, sequence version 1. 

DT 30-MAY-2006, entry version 47. 

DE Estrogen receptor (ER) (Estradiol receptor) (ER-alpha) . 

GN Name=ESRl; Synonyms=ESR / NR3A1 ; 

OS Pagrus major (Red sea bream) (Chrysophrys major) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Euteleostei ; Neoteleostei ; 

OC Acanthomorpha; Acanthopterygii ; Percomorpha; Perciformes; Percoidei; 

OC Sparidae; Pagrus. 

OX NCBI_TaxID=143350 ; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [MRNA] . 

RC TISSUE=Liver; 

RA Touhata K. , Kinoshita M. , Toyohara H., Sakaguchi M. ; 

RT "Sequence and expression of a cDNA encoding the red seabream estrogen 

RT receptor."; 

RL Fish. Sci. 64:131-135(1998). 

CC -!- FUNCTION: The steroid hormones and their receptors are involved in 

CC the regulation of eukaryotic gene expression and affect cellular 

CC proliferation and differentiation in target tissues. 

CC -!- SUBUNIT: Binds DNA as a homodimer. Can form a heterodimer with ER- 

CC beta (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Nucleus. 

CC -!- DOMAIN: Composed of three domains: a modulating N-terminal domain, 
CC a DNA-binding domain and a C- terminal steroid-binding domain. 

CC -!- SIMILARITY: Belongs to the nuclear hormone receptor family. NR3 
CC subfamily. 

CC -!- SIMILARITY: Contains 1 nuclear receptor DNA-binding domain. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AB007453; BAA22517.1; -; mRNA. 

DR HSSP; P03372; 1ERR . 

DR SMR; 042132; 140-212. 

DR InterPro; IPR0 0162 8; Hrmn_rcpt_DNA_bd . 

DR InterPro; IPR000536; Hrmn_rcpt_lig_bd . 

DR InterPro; IPR0 0 8 94 6 ; Nucl_rcpt_lig_bd . 

DR InterPro; IPR001292; Oestr_rcpt_AFl . 

DR InterPro; IPR012239; Oestrogen_rcpt . 

DR InterPro; IPR001723; Str_hrmn_rcpt . 

DR InterPro; IPR000324; VitD_rcpt . 

DR Pfam; PF00104; Hormone_recep ; 1. 

DR Pfam; PF02159; Oest_recep; 1. 

DR Pfam; PF00105; zf-C4; 1. 

DR PIRSF; PIRSF002527; ER-ab; 1. 

DR PRINTS ; PRO 0 3 9 8 ; STRDHORMONER . 

DR PRINTS; PR00047; STROIDFINGER . 

DR PRINTS; PR00350; VITAMINDR . 

DR ProDom; PD000035 ; Znf_C4steroid; 1 . 

DR SMART; SM00430; HOLI ; 1. 

DR SMART; SM0 03 99 ; ZnF_C4 ; 1 . 

DR PROSITE; PS0 0031; NUCLEAR_REC_DBD_1 ; 1. 

DR PROSITE; PS51030; NUCLEAR__REC_DBD_2 ; 1. 

KW DNA-binding; Lipid-binding ; Metal -binding ; Nuclear protein; Receptor; 

KW Steroid-binding; Transcription; Transcription regulation; Zinc; 
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KW 


Zinc-finger 








FT 


CHAIN 


1 


581 


Estrogen receptor. 


FT 








/FTId=PRO_0000053635 . 


FT 


DNA_BIND 


145 


210 


Nuclear receptor. 


FT 


ZN_FING 


145 


165 


NR C4-type. 


FT 


ZN_FING 


181 


205 


NR C4-type. 


FT 


REGION 


1 


144 


Modulating (By similarity) . 


FT 


REGION 


211 


272 


Hinge. 


FT 


REGION 


273 


581 


Steroid-binding (By similarity) . 


FT 


COMPBIAS 


261 


265 


Poly-Gly . 


FT 


COMPBIAS 


278 


281 


Poly-Leu. 


FT 


COMPBIAS , 


469 


473 


Poly-Leu. 


FT 


COMPBIAS 


537 


545 


Poly-Gly. 


SQ 


SEQUENCE 


581 AA; 


63461 MW 


; D499BB6E7B4EB964 CRC64 ; 


Query Match 




89.1%; 


Score 2752.5; DB 1; Length 581; 



Best Local Similarity 89.1%; Pred. No. 1.7e-189; 

Matches 522; Conservative 22; Mismatches 33; Indels 9; Gaps 4; 



Qy 1 MYPEESRGSGGVATVDFLEGTYDYAAPTPAPTPLYSQS - -GYYSVPLDAQGPPSDGSLQS 58 

1 1 1 hll I II I II II 1 1 II 1 1 Ml 1 1 M 1 1 1! 1 1 II I MM I II I 1 1 M 1 1 1 II I 

Db 1 MYPEDSRGSGGVATVDFLEGTYDYAAPTPAPTPLYSHSTPGYYSAPLDAHGPPSDGSLQS 6 0 

Qy 59 LGSGPTSPLVFVPSSPRLSPFMHPPSHHYLETTSTPVYRSSVPSSQQPVPREDQCATSDE 118 

Mill 1 1 1 1 1 II II 1 1 1 1 1 1 1! I hi 1 1 1 1 1 1 1 1 1 1 Ml 1 1 M 1 1 1 I Mill MM 

Db 61 LGSGPNSPLVFVPSSPRLSPFMHPPTHHYLETTSTPVYRSSVPSSQQSVSREDQCGTSDD 12 0 



Qy H9 SYSVGESGAG- -ARGFEMAKEMRFCAVCSDYASGYHYGVWSCEGCKAFFKRSIQGHNDYM 176 

M I II II M I I II I = I M I I I I I I I I I I II I I I II I I M I I I I I I I I I I I I I I I I I I 
Db 121 SYSVGESGAGALAAGFEIAKEMRFCAVCSDYASGYHYGVWSCEGCKAFFKRSIQGHNDYM 18 0 

Qy 177 CPATNQCTIDRNRRKSCQACRLRKCYEVGMMKGGVRKDRGRVLRRDKRRAGTNDREKASK 23 6 

I | | | | | I II I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I h I I I - I I - I I I I 
D b 181 CPATNQCTIDRNRRKSCQACRLRKCYEVGMMKGGMRKDRGRVLRRDKQRTGTSDRDKASK 240 

t 

Qv 23 7 DLEYKTVPPQDRRKHSSSSSAGGGGGKSSVTGMSPDQVLLLLQGAEPPMLCSRQKLSRPY 2 96 

MM 1 1 1 1 1 1 1 1 111111111111= I IIIIIMIMIMIIMIIIIMIII 

Db 241 GLEHRTAPPQDRRKH-ISSSAGGGGGKSSMISMPPDQVLLLLQGAEPPMLCSRQKLNRPY 299 

Qy 297 TEVTIMTLLTSMADKELVHMITWAKKLPGFLQLSLHDQVQLLESSWLEVLMIGLIWRSIH 356 

I | | | : | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3 00 TEVTMMTLLTSMADKELVHMIAWAKKLPGFLQLSLHDQVQLLESSWLEVLMIGLIWRSIH 359 

Qy 357 CPGKLIFAQDLILDRNEGDCVEGFVEIFDMLLATASRFRMLKLKPEEFVCLKAIILLNSG 416 

I II I I I II I II I I I M I I I I II I II I I I I I II II II II M M I I II II I I II I I II M 
Db 360 CPGKLIFAQDLILDRSEGDCVEGMAEIFDMLLATASRFRMLKLKPEEFVCLKAIILLNSG 419 

Qy 417 AFSFCTGTMEPLHNSMAVQNMLDTITDALIHHISQSGCSAQQQSRRQAQLLLLLSHIRHM 476 

1 1 1 II II II 1 1 lh I II 1 1 III 1 1 Ml 1 1 1 hll 1 1 II 1 1 II 1 1 MM 1 1 1 1 III I II 

Db 42 0 AFSFCTGTMEPLHDGAAVQNMLDTITDALIHHINQSGCSAQQQSRRQAQLLLLLSHIRHM 479 

Qy 477 SNKGMEHLYSMKCKNKVPLYDLLLEMLDAHRIHRPDRPAQFWSQADGEPPFINNNNSSNS 536 

I M M 1 1 II M 1 1 1 1 1 M I M 1 1 1 M 1 1 1 1 II 1 1 IMM Mill MM = Ml 

Db 480 SNKGMEHLYSMKCKNKVPLYDLLLEMLDAHRIHRADRPAETWSQADREPPFTSRNSSGGG 53 9 

Qy 537 GSNGGVSSSVGSSSGPRVNHESPSRGPTGPGVLQYGGSRSDCTHIL 582 

I M Ml IMIIMMIII II 1 1 1 II 1 1 1 1 1 Ml 1 1 1 1 

Db 540 GGGGGGSSSAGSTSGPRVSHES PTSPGVLQYGGSRSECTHIL 581 
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